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We present a measurement of the top quark mass in the all-hadronic channel (ti — + bb (71(72(73(74) 
using 943 pb _1 of pp collisions at y/s = 1.96 TeV collected at the CDF II detector at Fermilab 
(CDF). We apply the standard model production and decay matrix-element (ME) to it candidate 
events. We calculate per-event probability densities according to the ME calculation and construct 
template models of signal and background. The scale of the jet energy is calibrated using additional 
templates formed with the invariant mass of pairs of jets. These templates form an overall likelihood 
function that depends on the top quark mass and on the jet energy scale ( JES) . We estimate both 
by maximizing this function. Given 72 observed events, we measure a top quark mass of 171.1 ± 3.7 
(stat.+JES) ± 2.1 (syst.) GeV/c 2 . The combined uncertainty on the top quark mass is 4.3 GeV/c 2 . 

PACS numbers: 14.65.Ha, 12.15.Ff, 13.85. Ni 



* Deceased 

tWith visitors from a University of Massachusetts Amherst, 
Amherst, Massachusetts 01003, ^Universiteit Antwerpen, B-2610 



Antwerp, Belgium, c University of Bristol, Bristol BS8 1TL, 
United Kingdom, ^Chinese Academy of Sciences, Beijing 100864, 
China, e Istituto Nazionale di Fisica Nucleare, Sezione di Cagliari, 



4 



I. INTRODUCTION 

The top quark plays an important role in particle 
physics. Being the heaviest observed elementary parti- 
cle results in large contributions to electroweak radiative 
corrections and provides a constraint on the mass of the 
elusive Higgs boson. More accurate measurements of the 
top quark mass are important for precision tests of the 
standard model. In addition, having a Yukawa coupling 
close to unity may indicate a special role for this quark in 
electroweak symmetry breaking. Increasing the precision 
on the mass of the top quark is central not only for the 
standard model, but also for other theoretical scenarios 
beyond the standard model. 

At the Tevatron the top quark is produced most fre- 
quently via the strong interaction yielding a top/anti-top 
pair. Once produced, the top quark decays into a b quark 
and a W boson about 99% of the time according to the 
standard model. Based on the decay mode of the two 
W bosons the ti events can be divided in three chan- 
nels: the dilepton channel when both W bosons decay 
to leptons; the lepton+jets channel when one W boson 
decays to leptons and the other one decays to hadrons; 
and the all-hadronic channel when both W bosons decay 
to hadrons. 

This paper reports a measurement of the top quark 
mass in the all-hadronic channel using 943 pb" 1 collected 
with the upgraded CDF II detector at Fermilab. In Sec- 
tion[TT]we give a brief description of the CDF II detector. 

The all-hadronic final state consists of six jets, two of 
which are due to the hadronization of b quarks. The large 
QCD background and jet-parton combinatorics make 
measurements more difficult in this channel than in the 
others. However, because there are no unobserved final- 
state particles, it is possible to fully reconstruct all- 
hadronic events. In order to enhance the ti content over 
the background, special selection criteria are imposed on 
the kinematics and topology of the events. In Section [TTT1 
we give more details on this selection. 

Previous mass measurements of the top quark in the 
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all-hadronic channel were performed at CDF in both Run 
I [l[ and Run II Q. For the first time in this channel, 
we measure the mass of the top quark utilizing a tech- 
nique that uses the matrix element for ti production and 
decay. The details of the matrix element calculation and 
implementation are given in Section ITVl 

The matrix element is used to calculate a probability 
for each candidate event to be produced via the stan- 
dard model ti production mechanism. In principle, the 
mass of the top quark can be determined by maximizing 
this probability, and such a technique was successfully 
applied before at CDF in the lepton+jets channel [3j and 
in the dilepton channel 4] . In this analysis we take a new 
approach in that we calculate the matrix element prob- 
ability in samples of simulated ti events to build and to 
parameterize top mass templates. These are distribu- 
tions that depend on the mass of the top quark, unlike 
the templates for background events whose modeling is 
described in Section [Vj The measured value of the mass 
of the top quark corresponds to a ti template whose mix- 
ture with a background template best fits the data. In 
Section IVII we give more details on how these templates 
are built. 

Besides considering a matrix element for a different 
ti decay channel, in this analysis the matrix element is 
computed differently than in the aforementioned analyses 
in the leptonic channels. Also, the features of the matrix 
element probability are exploited to improve the event 
selection. 

The uncertainty on the jet energy scale has the largest 
contribution to the total uncertainty in most top quark 
mass measurements. In order to minimize this effect, 
we perform an in situ calibration of the jet energy scale 
using the invariant mass of pairs of light flavor jets. For 
ti events this variable is correlated with the mass of the 
W boson, and it is sensitive to variations in jet energy 
scale. Using this invariant mass we build the dijet mass 
templates, and we use them for the calibration of the jet 
energy scale as shown in Section IVII This procedure, 
used previously at CDF for the mass measurement of 
the top quark in the lepton+jets channel [5j, is used for 
the first time in the all-hadronic channel in the analysis 
described in this paper. 

The result of the data fit is the topic of Section IVII1 
while in Section IVIIII the associated systematic uncer- 
tainties are described. Finally, Section ILXl concludes the 
paper. 



II. DETECTOR 

The CDF II detector is an azimuthally and forward- 
backward symmetric apparatus designed to study pp col- 
lisions at the Tevatron. It is a general purpose detector 
which combines precision particle tracking with fast pro- 
jective calorimetry and fine grained muon detection. 

The CDF coordinate system is right handed, with z 
axis tangent to the Tevatron ring and pointing in the di- 
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rection of the proton beam. The x and y coordinates of a 
left-handed x,y, z Cartesian reference system are defined 
pointing outward and upward from the Tevatron ring, 
respectively. The azimuthal angle <j> is measured relative 
to the x axis in the transverse plane. The polar angle 
is measured from the proton direction and is typically 
expressed as pseudorapidity t] = — ln(tan|). We define 
transverse energy as Et — EsmO and transverse momen- 
tum as px = psmO where E is the energy measured in 
the calorimeter and p is the magnitude of the momentum 
measured by the tracking system. 

Tracking systems are contained in a superconducting 
solenoid, 1.5 m in radius and 4.8 m in length, which gen- 
erates a 1.4 T magnetic field parallel to the beam axis. 
The calorimeter surrounds the solenoid. A more com- 
plete description of the CDF II detector can be found in 
Ref. @ . The main features of the detector systems are 
summarized below. 

The tracking system consists of a silicon microstrip sys- 
tem and an open-cell wire drift chamber that surrounds 
the silicon. The silicon microstrip system consists of eight 
layers in a cylindrical geometry that extends from a ra- 
dius of r = 1.35 cm from the beam line to r = 29 cm. The 
layer closest to the beam pipe is a radiation-hard, single 
sided detector called Layer 00 @- The remaining seven 
layers are radiation-hard, double sided detectors. The 
first five layers after Layer 00 comprise the SVXII Q 
system and the two outer layers comprise the ISL sys- 
tem. This entire system allows track reconstruction in 
three dimensions. The resolution on the impact parame- 
ter for high-energy tracks with respect to the interaction 
point is 40 //m, including a 30 /im contribution from the 
beam-line. The resolution to determine zo (z position 
of the track at point of minimum distance to interac- 
tion vertex) is 70 um. The 3.1 m long cylindrical drift 
chamber (COT) [K| covers the radial range from 43 to 
132 cm and provides 96 measurement layers, organized 
into alternating axial and ±2° stereo superlayers. The 
COT provides full coverage for |?7| <1. The hit position 
resolution is approximately 140 /im and the transverse 
momentum resolution <j(pt)/Pt — 0.0015 GeV/c -1 . 

Segmented electromagnetic and hadronic sampling 
calorimeters surround the tracking system and measure 
the energy flow of interacting particles in the pseudora- 
pidity range \rj\ < 3.6. The central calorimeters (and 
the end-wall hadronic calorimeter) cover the pseudora- 
pidity range \rj\ < 1.1(1.3) and are segmented in towers 
of 15° in azimuth and 0.1 in r\. The central electromag- 
netic calorimeter [ll[ uses lead sheets interspersed with 
polystyrene scintillator as the active medium and photo- 
multipliers. The energy resolution for high-energy elec- 
trons and photons is 13.5%/ ' \/E T ®2%, where the first 
term is the stochastic resolution and the second term is 
a constant term due to the non-uniform response of the 
calorimeter. 
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The central hadronic calorimeter 
steel absorber interspersed with acrylic scintillator as the 
active medium. The energy resolution for single-pions 
is 75%/%/i?Tffl3% as determined using the test-beam 



data. The plug calorimeters cover the pseudorapidity 
region 1.1 < |?7| < 3.6 and are segmented in towers of 
7.5° for |?7| < 2.1 and 15° for \r]\ > 2.1. They are sam- 
pling scintillator calorimeters coupled with plastic fibers 
and photomultipliers. The energy resolution of the plug 
electromagnetic calorimeter [l3[ for high-energy electrons 
and photons is « 16%/ \/E T (Bl%. The energy resolu- 
tion for single-pions in the plug hadronic calorimeter is 
« 74%/v A E T ®4%. 

The collider luminosity is proportional to the average 
number of inelastic pp collisions per bunch crossing which 
is measured using gas Cherenkov counters [l4[ located in 
the 3.7 < I77I < 4.7 region. 

The data selection (trigger) and data acquisition sys- 
tems are designed to accommodate the high rates and 
large data volume of Run II. Based on preliminary infor- 
mation from tracking, calorimetry, and muon systems, 
the output of the first level of the trigger (level 1) is used 
to limit the rate of the accepted events to « 18 kHz at 
the luminosity range 3— >7 x 10 31 cm~ 2 s _1 . At the next 
trigger stage (level 2), with more refined information and 
additional tracking information from the silicon detector, 
the rate is reduced further to ~ 500 Hz. The final level 
of the trigger (level 3), with access to the complete event 
information, uses software algorithms and a farm of com- 
puters to reduce the output rate to w 100 Hz, which is the 
rate at which events are written to permanent storage. 



III. DATA SAMPLE AND EVENT SELECTION 

The expected signature of a tt event in the all-hadronic 
channel {tt — » bb <7ig~2<73<74) is the presence of six jets in 
the reconstructed final state. Jets are identified as clus- 
ters of energy in the calorimeter using a fixed-cone algo- 
rithm with radius 0.4 in rj-cj) space [15] ■ The energy of the 
jets needs to be corrected for various effects back to the 
energy of the parent parton. The CDF jet energy cor- 
rections are divided into several levels to accommodate 
different effects that can distort the measured jet energy: 
non-uniform response of the calorimeter as a function of 
7], different response of the calorimeter to different parti- 
cles, non-linear response of the calorimeter to the particle 
energies, uninstrumented regions of the detector, multi- 
ple pp interactions, spectator particles, and energy radi- 
ated outside the jet clustering cone. In this analysis we 
correct the energy of the jets taking into account all of 
the above effects except those due to spectator particles 
and energy radiated outside the cone. These additional 
corrections are recovered using the transfer functions de- 
fined in Section ITVl 

A detailed explanation of the procedure to derive the 
various individual levels of correction is described in 
Ref. [IH . Briefly, the calorimeter tower energies are first 
calibrated as follows. The scale of the electromagnetic 
calorimeter is set using the peak of the dielectron mass 
resonance resulting from the decays of the Z boson. For 
the hadronic calorimeter we use the single pion test beam 
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data. This calibration is followed by a dijet balancing 
procedure used to determine and correct for variations in 
the calorimeter response to jets as a function of r\. This 
relative correction ranges from about -10% to +15%. Af- 
ter tuning the simulation to reflect the data, a sample of 
simulated dijet events generated with pythia [l7j is used 
to determine the correction that brings the jet energies 
to the most probable true in-cone hadronic energy. The 
absolute correction varies between 10% and 40%. 

A systematic uncertainty on these corrections is de- 
rived in each case. Some are in the form of uncertainties 
on the energy measurement themselves, and some are 
uncertainties on the detector simulation. Typical over- 
all uncertainty is in the range of 3% to 4% for jets with 
transverse momentum larger than 40 GeV/c. More de- 
tails on the estimation of these uncertainties can be found 
in Q. 

The data sample is selected using a dedicated multi- 
jet trigger defined as follows. For triggering purposes the 
calorimeter granularity is simplified to a 24 x 24 grid in 
Tj-<p space. A trigger tower spans approximately 15° in <p 
and 0.2 in r\ covering one or two physical towers. At level 
1, we require at least one trigger tower with transverse 
energy Etf w > 10 GeV. At level 2, we require the sum of 
the transverse energies of all the trigger towers, E^ w , 
be > 175 GeV and the presence of at least four clusters 
of trigger towers with E^ s > 15 GeV. Finally, at level 3 
we require four or more reconstructed jets with Et > 10 
GeV. This trigger selects about 80% of the ti events in 
the all-hadronic channel. The main background present 
in this data sample is due to the production of multi-jets 
via QCD couplings. 

This analysis relies on Monte Carlo event generation 
and detector simulation to model the tt events. We 
use herwig v6.505 [l8| for the event generation. The 
CDF II detector simulation [l^l reproduces the response 
of the detector to particles produced in pp collisions. 
Tracking of particles through matter is performed with 
GEANT3 (2(|. Charge deposition in the silicon detec- 
tors is calculated using a parametric model tuned to 
the existing data. The drift model for the COT uses 
the Garfield package [2l|, with the default parameters 
tuned to match COT data. The calorimeter simulation 
uses the GFLASH [22j parameterization package interfaced 
with GEANT3. The GFLASH parameters are tuned to test- 
beam data for electrons and pions. We describe the mod- 
eling of the background in Section [V] 

The events passing the trigger selection are further re- 
quired to pass a set of clean-up cuts. First, we require 
the reconstructed primary vertex [23[ in the event to lie 
inside the luminous region Qz\ < 60 cm). In order to 
reduce the contamination of the sample with events from 
the leptonic tt decays, we veto events which have a well 
identified high-p^ electron or muon [24| , and require that 
^ T be < 3 GeV 1 / 2 (25[, where the missing transverse 

energy, fix |26(, is corrected for both the momentum of 
any reconstructed muon and the position of the pp in- 



teraction point. The quantity ^2 Et is the sum of the 
transverse energies of jets. 

After this preselection, we consider events with exactly 
six jets, each with transverse energy Et > 15 GeV and 
1 77 1 < 2. With these six jets, we calculate four vari- 
ables that are used for the kinematic discrimination of 
tt from background. One of these variables is Et de- 
fined above. Another variable, 5Z 3 -Er, is the sum of 
the transverse energies of jets removing the two leading 
jets. We define centrality, C, as , , where 

^2E and ^2pz are the sum of the energies of jets and 
the sum of the momenta of jets along the z-axis, respec- 
tively. The fourth variable is the aplanarity, A, defined 
as |Qi- Here Qi is the smallest normalized eigenvalue 

of the sphericity tensor S a b = Ylj^a^bf wnere P£ is 
the momentum of a jet along one of the Cartesian axes. 
We select events which satisfy the following kinematical 
cuts: A + 0.005^3 E T > 0.96, C > 0.78, and J2 e t > 
280 GeV. More details on the clean-up cuts, kinematical 
and topological variables as well as the optimization of 
the cuts are given in Ref . [13] ■ 

Since the final state of a tt event is expected to contain 
two jets originating from b quarks, their identification is 
important for enhancing the tt content of our final data 
sample. Jets are identified as b jets using a displaced ver- 
tex tagging algorithm. This algorithm looks inside the 
jet for good-quality tracks with hits in both the COT 
and the silicon detector. When a displaced vertex can 
be reconstructed from at least two of those tracks, the 
signed distance {L^d) between this vertex and the pri- 
mary vertex along the jet direction in the plane trans- 
verse to the beams is calculated. The jet is considered 
tagged if L 2 d/o'(L2d) > 7.5, where a{Liu) is the un- 
certainty on Lj2d- This algorithm has an efficiency of 
about 60% for tagging at least one b jet in a simulated 
tt event. More information concerning b tagging is avail- 
able in Ref. [23j]. In order to improve the signal purity, 
we require the existence of at least one secondary vertex 
tag in the event. 

We introduce a new variable, minLK L, defined as the 
minimum of the event probability calculated using the 
matrix element technique (see Section IIVI for details). 
Figure [1] shows the distribution of minLK L for a sim- 
ulated tt sample with M top = 175 GeV/c 2 (continuous 
line) and for the background (dashed line), after kine- 
matical and b tagging requirements. Here and through- 
out this paper we use M top to label the top quark mass 
used in the event generation. The event probability used 
in Figure [T] is not normalized due to omission of multi- 
plicative constants in its calculation. Although techni- 
cally this variable is not a probability, we will keep using 
this name. To further reduce the background contribu- 
tion, we require that minLKL < 10. The value of the 
cut on minLKL is obtained by minimizing the statisti- 
cal uncertainty on the top quark mass reconstructed us- 
ing only the matrix element technique. The optimization 
was done for various top mass quark values using a com- 
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bination of simulated tt events and background events 
(described in Section IV) . 




minLKL (Minimum of Negative Log Event Probability) 

FIG. 1: Distribution of minLKL (minimum of the negative 
log event probability) for simulated ti events with M top ~ 
175 GeV/c 2 (continuous) and for background events (dashed) 
modeled in Section [V] The events pass the kinematical and 
^-tagging requirements. 



IV. MATRIX ELEMENT TOOL 

For each event passing our kinematical and topologi- 
cal requirements, we calculate the corresponding elemen- 
tary cross section assuming ti production followed by the 
all-hadronic decay. In this calculation, we consider the 
momentum 4-vectors of all the observed six jets, but we 
assume them to be massless. The fraction of the total 
ti cross section contributed by an event can be inter- 
preted as a probability density for the given event to be 
part of the ti production. As it is shown in Section [IV Al 
for each event this probability density depends on the top 
quark mass. The mass value that maximizes the event 
probability is used in the top quark mass reconstruction 
technique described in Section IVll A likelihood function 
obtained by combining the probability densities of a set 
of events can also be used to reconstruct the top quark 
mass. We use this technique in subsection II V Bl only to 
validate the matrix element calculation used in the prob- 
ability density determination, and not for the final mass 
reconstruction. 



Table [J shows the observed number of events in the 
multi-jet data sample corresponding to a integrated lu- 
minosity of 943 pb -1 that pass the full event selection. 
The table also shows the expected number of ti events 
(S) based on a sample of simulated ti events assuming 
the theoretical value of 6.7 pb [28[ for the tt production 
cross section. The signal-to-background ratio (S/B) is 
also shown, where the number of background events (B) 
is taken as the difference between the observed number 
of events and the ti expectation (S). Based on the re- 
sults reported in Table [H the minLKL cut improves the 
signal-to-background ratio by a factor of three for the 
sample where only one secondary vertex tag is required, 
and by a factor of two for the sample where at least two 
tags are required. 



TABLE I: Number of observed multi-jet events passing the 
event selection corresponding to an integrated luminosity of 
943 pb -1 . The table also shows the expected number of 
ti events (S) and the corresponding signal-to-background ratio 
(S/B). The number of ti events is based on a sample of simu- 
lated events assuming the theoretical value of 6.7 pb 28] for 
the ti production cross section. The number of background 
events is taken as the difference between the observed number 
of multi-jet events and the ti expectation (S). 



Selection 


Single 


Tag 


Double Tag 




Observed 


S S/B 


Observed S S/B 


Kinematical 


782 


71 1/10 


148 47 1/2 


minLKL < 10 


48 


13 1/3 


24 14 1/1 



A. Definition of the probability density 

For any event defined by a set of six 4-momenta, the 
elementary cross section at a given top quark mass m can 
be computed as if the event were the result of ti produc- 
tion followed by all hadronic decay: 



da{m,j) 



dz a dz b f(z a )f(z b ) 



4E a E b \v a 
(2n) 4 S^(E F -E I )]J 



Vb\ 
6 



\M(m,j)\ 2 



{2n) 3 2Ei 



(1) 



Here, z a {z b ) is the fraction of the proton(antiproton) mo- 
mentum carried by the colliding partons; f(z a ) and f(z b ) 
are the parton distribution functions for protons and for 
antiprotons respectively; v a and v b , and E a and E b rep- 
resent the velocities and, respectively, the energies of the 
colliding partons; j is a generic notation for all six 4- 
momenta in the event assuming perfect parton identi- 
fication and reconstruction; M(m,j) is the matrix ele- 
ment corresponding to ti production and decay in the all 
hadronic channel; Ep(Ej) is a generic notation for the 
4-vector of the final(initial) state. 

If we sum the elementary cross sections of all the 
events passing our event selection (trigger, kinematical 
and topological) without the minLKL requirement, we 
obtain a fraction (e(m)) of the total ti cross section, 
0tot(m): 



a(m) = / da(m,j) = a tot {m)e(m) 



(2) 



The fraction e(m) is equivalent to the event selection ef- 
ficiency for ti events and is determined using samples of 
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simulated tt events. 

For each event, we define the probability density 
P(j\m) as 



P{j\m) 



da(mj) 



<Jtot(m)e(m) ]J i=1 d?j l 



(3) 



The quantity P(J\m) fli=i d^ji is the probability for an 
event defined by the set of six jets (i.e., six 4- momenta) to 
be the result of tt production followed by an all hadronic 
decay for top quark mass m. 

The final state partons from tt decay are observed as 
jets in our detector. Using simulated tt events we calcu- 
late transfer functions, TF(j\p), which represent a prob- 
ability for a parton with momentum p to be observed 
as a jet with momentum j. The transfer functions are 
described in Section HV A 31 

In order to enhance the features of the tt phase space, 
an additional weight, Pt(p), is introduced. As it is shown 
in Section HV A 41 this weight is obtained from the distri- 
bution of the transverse momentum of the tt system in 
simulated tt events. 

We assume that all six jets present in an all-hadronic 
tt event are the result of the hadronization of quarks in 
the final state. There is an ambiguity in assigning the 
jets to the quarks, and therefore all the possible combi- 
nations are considered and averaged. The counting of all 
possible assignments is detailed in Section IIV A 21 The 
full expression of the probability density is given by: 



P{j\m) = E / 



dz a dz b f(z a )f(z b ) 



AE n E h \v n 



combi 

\M{m,p)\ 2 (2n)H^(E F 



n 



d 3 pi 



{2ir) 3 2Ei 



i=l 

E T )TF{]\p)P T (p) 



crtot(m)e(m)N combi 



(4) 



where the sum is performed over all jet-parton combi- 
nations and N com i,i is the total number of possible jet- 
parton assignments. 

The calculation of the matrix element M(m,p) is de- 
tailed in Section [IV A II and the integration performed in 
Eq. 0]is described in Section [TV A 51 



1. Calculation of the matrix element 

Two processes contribute to tt production: gluon- 
gluon fusion and quark-antiquark annihilation. At the 
Tevatron, about (15 ± 5)% of tt events are expected to 
be produced by gluon-gluon fusion while the remaining 
85% are produced by quark-antiquark annihilation [28| . 
In addition, 90% of the simulated tt events produced by 
quark-antiquark annihilation result from uu annihilation. 
Given that having both types of tt production doubles the 
calculation time, we only use the matrix element describ- 
ing the process uu — >tt — > (bud)(bud). To validate this 
choice, we reconstruct the top quark mass using a uu ma- 
trix element in a sample of tt events produced only via 



gluon-gluon fusion. We observe a negligible bias (0.0 ± 
0.2 GeV/c 2 ) in the reconstruction of the top quark mass 
and we conclude that using a matrix element with uu as 
the initial state should be sufficient for the mass recon- 
struction. 

For the final state, having a W boson decay into a 
ud pair or a cs pair results in no difference for the final 
reconstruction as both pairs of quarks will be observed 
as jets. The other hadronic decays are suppressed since 
their rate is proportional to the square of small elements 
of the Cabibbo-Kobayashi-Maskawa matrix (2j| . 

In the high-energy limit (or the massless limit), the 
solutions to the Dirac equation can be written as 



(5) 



where p = (E p ,p) is the 4-momentum of a particle. 
The solution with positive frequencies is u(p), and that 
with negative frequencies is v(p); er M = (1,<t) and = 
(1, — a), where a are the Pauli spin matrices. 

The presence of the operator p ■ a will project the spin 
states along the direction of motion defined by p. For 
a particle traveling in the direction defined by the polar 
angle 9 and by the azimuthal angle <f>, the spin states 
along this direction are given by Eq. [SJ 




£(T) 



cos ~ 
sin ■ 



(G) 



For an antiparticle we have rj(1) = £(J,) and rj(l) = 
—£(!)■ Given these relations and assuming that the in- 
coming partons travel along the z-axis, the matrix ele- 
ment has only two non-zero terms due to the initial state 
partons, Irr and III- These are 4- vectors and corre- 
spond to the situations when the incoming partons have 
the same chirality. Considering the proton going in the 
positive direction, these terms are: 



RR 



j2EfJ2EB (0,1,1,0) 



I1 L = y/2q?J2Ejp (0,1, -i,0) 



(7) 



where E™ and E^ 1 are the energies of the incident u 
quark and u quark, respectively. 

Omitting all multiplicative constants, we express the 
matrix element squared as 

\M\ 2 ^ £ \M\ 2 = Fl-P g -P t -P r P Wl 

spins 
colors 

xPw 2 -(\M R r\ 2 + \M ll \ 2 ) (8) 
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where the factors entering this expression are 

' F E = ^(E b ){E^(E u ){E^{E d ){E^)(E^){E^) 

P - - i 

< r t.t — (p^_-m 2 ) 2 +m 2 r^ 

p _ ' 1 

(9) 

In Eq. [9] and Tw are the mass and the width of the 
W boson, m and T t are the mass and the width of the 
top quark, and W+ and W- are the 4-momenta of the 
W bosons. 

Also in Eq. [5] Mj stands for both Mrr and Mll 
(Eq. [5]) , the difference arising from the definition of the 
symbol 5/. The symbol Si is defined as 

5/ = vtl^lvtffip + m 2 I^ (10) 

where 7 M are the Dirac matrices and I is either Irr or 
III- We calculate Mrr and Mll using Eq.[B]and matrix 
algebra [30j . 

12. Combinatorics 

While there are 6! = 720 ways to assign the observed 
jets to the six partons of the final state in all hadronic 
it decay, we can take into account a reduced number of 
possibilities by making a few observations and assump- 
tions. 

In the case of the process uu — ► it , assuming that 
the masses of the up quarks are negligible and omitting 
the constant and the gluon propagator terms, the spin 
averaged matrix element squared is 

\ l-^' 2 ~ (Pu-Pt){Pu-Pt) + {Pu-Pt)(Pu-Pt) + m(Pu-Pu) 

spins 

(ii) 

where p is the 4-momentum of a particle. 

From this expression, the t t symmetry is evident. 
The symmetry holds also for the matrix element of the 
process containing the decay of the top quarks. This sym- 
metry can be translated into a symmetry to b <-> b once 
we consider all possible b-W pairings for each top quark: 

{t=(b 1 ,W 1 ),i=(b 2 ,W2)},{t=(h,W 2 ),t=(b2,Wl)}. 

It is obvious that swapping the 6's is equivalent with 
swapping the top quarks. 

In conclusion, due to the t <-» t symmetry the number 
of relevant combinations is 360. Secondly, if any of the 
jets is identified as a secondary vertex tag we assume 
that jet be produced by a b quark. This assumption 
results in a factor of three reduction of the number of 
relevant combinations, down to 120 (or 5!). If there is 
an additional secondary vertex tag, we get a factor of 
five reduction down to 24 (or 4!). If there are more than 



two secondary vertex tags, we assign to 6 quarks only 
the two jets with the highest transverse energy. Note 
that the quarks in the decay of either W boson can not 
be interchanged in the matrix element calculation as one 
is particle and the other is antiparticle and they have 
different spinors. 



3. Transfer functions 

The transfer functions, TF(j\p), express the probabil- 
ity for a parton with momentum p to be associated with 
a jet reconstructed to have momentum j. The transfer 
function term from Eq.[4]is in fact a product of six terms, 
one for each of the final state quarks: two for the b quarks 
and four for the decay products of the W bosons. For 
each jet in the final state we assume that the jet axis is 
the same as that of the parton that went on to form the 
jet. Making the change of variables j — > £ = 1 — j/p, the 
expression for TF(j\p) becomes: 

6 6 

TF(j\p) -> JJT^IPJ) =HTF(C(ji)\pi) 

i=i i=i 

X H0 tfvaij. -sip.) (12) 
Pi 

where ftj i and flp i are the solid angles of the jets 
and of the quarks, respectively. The transfer functions 
TF(Q\pi) are built using simulated it events with M top = 
175 GeV/c 2 surviving the trigger, kinematical and topo- 
logical requirements. The choice of M top — 175 GeV/c 2 
is arbitrary as our studies show that the transfer func- 
tions have a negligible dependence on the mass of the top 
quark in the range 150 GeV/c 2 to 200 GeV/c 2 . In this 
sample, we associate a jet with a parton if their separa- 
tion in the r\ — (j> space is AR — \J Ar\ 2 + A(j) 2 < 0.4. We 
define a jet to be matched to a parton if no other jet sat- 
isfies this geometrical requirement. We define a it event 
to be matched if each of the six partons in the final state 
has a unique jet matched to it. The transfer functions 
are built out of the sample of matched events. 

The jets formed by partons from W^-bosons decays have 
a different energy spectrum from that of the jets origi- 
nating from the b quarks. Thus we form different sets of 
transfer functions depending on the flavor of the parton 
the jet has been matched to. 

The transfer functions are described using a parame- 
terization in bins of the parton energies and of the parton 
pseudorapidities. We use three bins for the pseudorapid- 
ity: 0.7, 0.7 «-> 1.3, and 1.3 <-> 2.0. Table M shows 
the definition of energy binning for the 6-jet transfer func- 
tions, while Table Hill is for the W-jet transfer functions. 
The energy binning is chosen such that the distributions 
for transfer functions are smooth. In each bin, the shape 
of the transfer function is fitted to a normalized sum of 
two Gaussians. 



10 



TABLE II: Definition of the binning of the parton energy 
for the 6-jet transfer functions parameterization for various n 
bins. The unit for the energy values is GeV. 



Bin 


o< N 


< 0.7 


0.7 < |q 


< 1.3 


1.3 < 


\r)\ < 2 -0 


1 


10 - 


* 53 


10 -> 


83 


10 


— > oo 


2 


53 - 


» 64 


83 -> 


111 






3 


64- 


» 74 


111 - 


> oo 






4 


74- 


» 85 










5 


85 - 


> 97 










G 


97^ 


114 










7 


114- 


-> oo 











TABLE III: Definition of the binning of the parton energy for 
the IF -jet transfer functions parameterization for various r\ 
bins. The unit for the energy values is GeV. 



Bin 


o< N 


< 0.7 


0.7 < |77 


<1 


1 


10 - 


* 32 


10 -> 


50 


2 


32 - 


» 38 


50^ 


63 


3 


38 - 


* 44 


63 -> 


76 


4 


44- 


» 49 


76^ 


90 


5 


49 - 


» 54 


90 -► 


108 


6 


54- 


> 59 


108- 


* oo 


7 


59 - 


» 64 






8 


64- 


> 69 






9 


69 - 


> 75 






10 


75 - 


* 81 






11 


81 - 


* 89 






12 


89 - 


» 99 






13 


99 -> 


113 






14 


113- 


-> oo 







1.3 < M < 2.0 



10 

98 



98 

oo 



4- Transverse momentum of the tt system 

The Pt (p) weight (introduced in Eq. 2]) is a function 
dependent on the momenta of the partons in the final 
state, generically represented by p in the argument of 
the function. More exactly, this weight depends on the 
magnitude of the transverse momentum of the tt system, 
p", and azimuthal angle, 0". As we expect to have a 
flat dependence on 0" we express this through a factor 
of 1/2tt. We define the function Px(Pt) to express the 
dependence on . We write in Eq. [T3] the expression 
of the weight due to the transverse momentum of the 
tt system. 



Pt (p t S=\(P t I) 



(<) 2 



2tt 



yW) 2 + (pf) 2 



(13) 

where ptp is shown in its Cartesian form using the projec- 
tions of the transverse momentum of the tt system along 
the x and y axes. 



Using the same simulated tt events as for transfer func- 
tions, in Fig. [2] we show the distribution of the magnitude 
of the transverse momentum of the tt system. A sum 
of three Gaussians is a good fit of this distribution. 




res reo 
P T of tt (GeV/c) 



FIG. 2: Magnitude of the transverse momentum of the tt sys- 
tem in simulated ti events. The fit is a sum of three Gaussians. 



5. Implementation and evaluation of the probability density 



The Sections IIVA21 IIVA11 IIVA3I and IIV A 41 

present details on the expressions of several important 
pieces entering the probability density. To carry out the 
integration over parton momenta, we change to a spher- 
ical coordinate system. The delta functions 5 1 - 2 ' (Oj 4 — 
f2p 4 ) present in the expression of the transfer functions 
TF(j\p)(Eq. [T2")) allow us to drop all integrals over the 
parton angles. 

To further reduce the number of integrals we use the 
narrow width approximation for the W bosons. This 
results in two more delta functions for the squares of the 
propagators of the two W bosons exemplified in Eq. [14] 
for both bosons. 



Pw = 



(P? 



W 



M 2 F 2 



<5(P, 



M W T 



w 



(14) 



In the high-energy limit, the invariant mass of the W + - 
boson decay products is given by: 

= 2pip 2 sin 9i sin 2 (cosh A7712 - cos Ad) 12 ) = 

= 2pip 2 wi2(ni,0 2 )(15) 

where A7712 is the difference in pseudorapidities of the 
two decay partons and A0i 2 = tt — \ \cj)i — (f> 2 \ — 7r| is the 
difference between their azimuthal angles. 

Making the change of variables P^ + — > pi , Eq. [14] can 
be written as: 



Pi 



w+ 



1 



M W T W 2^2^12(^1,^2) 



*(pi-p?) (16) 
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where p® — Myy/ (2^2^12)- In the case of the W~ bo- 
son we use equations similar to Eqs. 1151 and ll6l but with 
different notations: the change of variables is P%y- ~> P3 
and the pole of the delta function is p^ = M^/ (2^41^34). 
The mass and width of the W boson are fixed at 
80.4 GeV/c 2 and 2.1 GeV/c 2 , respectively [p. 



As described in section HV A 11 we assume that the in- 
coming partons have zero transverse momentum. This 
would, in principle, result in violation of momentum con- 
servation in the transverse plane as we consider non-zero 
transverse momentum for the ti system in the ME cal- 
culation. However, we expect this to be a small effect 
covered by the uncertainty on the parton distribution 
functions of the proton and of the antiproton. We can 
omit the delta functions requiring energy conservation 
along the x and y axes, resulting in 

6 

5^(E F - Ej) -> 5(E a + E b - Y,Pi) 

i=l 

6 



i=l 



6 6 
i=l i=l 



(17) 



We make the change of variables z a — > p u and Zb 



Pu since z a 



Pu/p : 



'proton 



and z b = Pu/p, 



'antiproton • 



The 



values of the proton and antiproton momenta, p pro ton 
and Pantiproton, are constant and from now on we drop 
them from any expressions. In the high-energy limit we 
have \v a — Vb\ = 2c and we omit this term since c is 
a constant, the speed of light. We express the energy- 
conserving delta function as 



Pb- 



Px = Pb cos 4>b sin Ob+Pb cos ^ sin % 
Pb sin 4> b sin b + Pi sin 4% sin % - 



P 



. (20) 

We write the expression of the probability density in 
its final form as 



P(j\m) = E / 



dp t *dp t y t dp 2 dp 4 



combi 1 



crtot{m)e{m)N combl 



, J(b 6) P6Pg /(p°)/(p°) 

(W12) 2 (W 34 ) 2 P2P4 



n 



TF(Ci\ Pi ) 



Pt 



P g ■ P t ■ P T ■ (\M RR \ 2 + \M LL \ 2 ) (21) 



We evaluate the integrals in Eq. [21] numerically. The in- 
tegration is performed in the interval [—60, 60] GeV/c for 
the variables and [10,300] GeV/c for the variables 
P2,4- The step of integration is 2 GeV/c. Based on a 
sample of ti events where M top = 175 GeV/c 2 passing 
the event selection, we choose these integration ranges 
such that the distributions of the parton level variables 
(Px y and momenta of VF-boson decay partons) are con- 
tained well (99%) within them. Given these limits, at 
each step of integration we have to make sure that all 
momenta entering Eq. (2D have positive magnitudes. The 
probability density is evaluated for top mass values going 
in 1 GeV/c 2 increments from 125 GeV/c 2 to 225 GeV/c 2 . 

The dependence on mass of the ti cross section is ob- 
tained from values calculated at leading-order by COM- 
PHEP [32j Monte Carlo generator for the processes uu — > 
it, dd — > it and gg — > ti. The absolute values for these 
cross sections are not as important as their top mass de- 
pendence, which is shown in Fig. [3J 



6^(E F -E I )^5( Pu +p T[ -J2p i ) 

i=l 

6 

xS(p u -Pu~ ^PiCOsOi) = 

1=1 

= \5{Pu-pI)5{p^~pI) (18) 

where p° u = ELiM 1 + cos 6^/2 and p^ = EL1M 1 ~ 
cos 9i)/2. 



In section IIV A 41 we expressed Pr{p) as a function 
of the projections of the transverse momentum of the 
it system along the x and y axes (Eq. [T3"|) . We will make 
a change of variable from the 6-quark momenta to these 
variables. The Jacobian of this transformation 



J(b -> 6) 



1 



sin 8b sin 6>j{cos <pb sin cj>r — sin <pb cos cf>r) 



is obtained by solving the system of equations for pb and 
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FIG. 3: Cross section for tt production as a function of the 
top quark mass, as obtained from COMPHEP [32I ]. 

For the proton and antiproton parton distribution 
functions (PDF), /(p°)/(p£), we use the CTEQ5L [H 
distributions with the scale corresponding to a top mass 
of 175 GeV/c 2 . The ti acceptance, e(m), is described in 
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Section HVH 



B. Validation of the matrix element calculation 



The event probability described in the Section IIV Al is 
expected to have a maximum around the true top quark 
mass in the event. Multiplying all the event probabilities 
we obtain a likelihood function, 



L(M top )= J] P(i\Mto P ) 



(22) 



which is expected to have a maximum around the true 
top quark mass of the sample. Finding the value of the 
top quark mass that maximizes the likelihood represents 
the traditional method for reconstructing the top quark 
mass using a matrix element technique [4J. However, 
we use this reconstruction technique only to check the 
matrix element calculation. 

We use the simulated tt samples generated with vari- 
ous top quark masses. For each sample, we reconstruct 
the top quark mass using the traditional matrix element 
technique and compare the reconstructed mass to the 
true input mass M top for several different input mass val- 
ues. Ideally, we should see a linear dependence with no 
bias and a unit slope. 

The first check is done at the parton level. We smear 
the energies of the final state partons from our simu- 
lated tt events and use these numbers to describe the 
jets. The parton energies are smeared according to the 
transfer functions described in section IIV A 31 Figure 2] 
shows the linearity check in this case. We observe a slope 
of rsI and a bias of 0.9 GeV/c 2 . 

We perform the same test using the energies of the 
jets matched to the partons. Figure O shows the linearity 
check. Here the bias is 1.2 GeV/c 2 , but the slope remains 
«1. The final test we perform to validate the matrix 
element calculation uses fully reconstructed signal events 
where we allow events to include mismatched jets as well. 
Figure [5] shows the linearity check in this case. The bias 
is no longer the same for all masses as the slope is 0.94 
± 0.01. 

Although there is some bias, all checks we list above 
show the good performance of our matrix element cal- 
culation. In general, the traditional matrix element ap- 
proach [3j is expected to provide a better statistical un- 
certainty on the top mass than the template analyses [j| . 
In the case of the present analysis, our studies show that 
the traditional matrix element method does better only 
when the mass reconstruction is performed on signal sam- 
ples. When the background is mixed in, the template 
method we use has a greater sensitivity and by construc- 
tion eliminates the bias of the matrix element calculation 
(see Section IVlf . 
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FIG. 4: Reconstructed top mass versus input top mass at 
parton level. The energies of the partons have been smeared 
using the transfer functions. The continuous line y = x is 
added for visual reference. 
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FIG. 5: Reconstructed top mass versus input top mass using 
jets that were uniquely matched to partons. The continuous 
line y = x is added for visual reference. 
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FIG. 6: Reconstructed top mass versus input top mass using 
realistic jets. The continuous line y — x is added for visual 
reference. 
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V. BACKGROUND MODEL 

In this section we describe the data-driven technique 
used to model the background for this analysis. The 
technique uses jet energies which are measured in the 
calorimeter and so are unchanged by jet energy scale 
changes. Properties of the model arc checked by com- 
parison with a simulated sample of events containing the 
final state 66 + 4 light partons. 

The modeling of background is based on a subset of 
the multi-jet data sample depleted of tt events where the 
heavy flavor jets are identified according to background- 
like heavy flavor rates (tagging matrix) , described in Sec- 
tion IV Al The subset of multi-jet data is selected ap- 
plying the event selection of Section IIIII excluding the 
minLKL and the secondary vertex tag requirements. 
This sample (BG) counts 2652 events, with an estimated 
signal-to-background ratio of about 1/25. For this ra- 
tio we estimate the signal from a sample of simulated 

events assuming a tt production cross section of 6.7 pb. 
The estimate for the background is equal to the number 
of observed events in the BG sample. 



A. Tagging matrix 

The tagging matrix is a parameterization of the heavy 
flavor rates as a function of the transverse energy of jets, 
the number of tracks associated to the jet and the number 
of primary vertices in the event. Using the 6-tagging al- 
gorithm described in Section ITTTl wc determine the above 
rates in a sample (4J) largely dominated by QCD multi- 
jet processes and selected from multi-jet data events with 
exactly four jets and passing the clean-up requirements 
described in Section HTT1 

We use a control region to check our assumption that 
the tagging rates from the 4-jet sample can be used to 
predict the tagging rates as a function of the variables 
used in the kinematical selection. This control region 
(CR1) contains events with exactly six jets and passing 
the clean-up cuts. The signal-to-background ratio in this 
region is about 1/250, estimated using same method as 
for the BG sample. We compare the observed rates with 
the predicted rates based on the tagging matrix. Fig- 
ure [7] shows the comparison for events with exactly one 
secondary vertex tag, while Fig. [5] shows the comparison 
in the sample with at least two secondary vertex tags. 
The variables chosen for this comparison are the trans- 
verse energies of jets, sum of the transverse energies of 
the six leading jets, aplanarity, and centrality as defined 
in Section HTTl The Kolmogorov-Smirnov probabilities for 
these comparisons in the single (double) tagged samples 
are: 0.0 (8.6E-5), 3E-11 (0.69), 0.99 (4.3E-3), and 0.12 
(0.05), respectively. 

Based on Fig. [7(a), the discrepancy between the ob- 
served rate and the predicted rate for jets with low trans- 
verse momentum may be an artifact of the binning of the 
tagging matrix. For transverse energies between 15 GeV 



and 40 GeV the tagging matrix uses the average rate, 
and therefore the rates for smaller intervals in this range 
might not be predicted well. Figures [7(a) and[5](a) sup- 
port this by showing that, for this range of transverse 
energies, half of the data points are below and the other 
half is above the solid histogram representing our back- 
ground model. 

The overall agreement between the observed and pre- 
dicted rates is quite poor. In principle, a systematic un- 
certainty should be assigned to cover this discrepancies. 
However, the templates used in the mass measurement 
use the event probability based on matrix element infor- 
mation and they will be less affected by these inaccu- 
racies. The reason for this is the fact that we use only 
a tt matrix element. For background events the event 
probability (Eq. [2Tj) is flat as a function of the assumed 
top quark mass. The flatness of the event probability 
results in wide templates for the background sample and 
the systematic effects due to the mistag matrix will get 
smeared. In fact, the background templates in the con- 
trol regions defined in Section IV Bl agree very well with 
the corresponding distributions based on the simulation 
of background events with 66 + 4 light partons in the 
final state. 

We conclude that the tagging matrix can be used to 
predict the background-like heavy flavor rates for events 
with same jet multiplicity as expected for the all-hadronic 
tt events. More details on the tagging matrix can be 
found in Ref. [27[ . 




Aplanarity (A) Centrality (C) 



FIG. 7: Background validation in control region CR1 for sin- 
gle tagged events from the multi-jet data (dots) and from the 
background model (solid histogram). The distributions are 
normalized to the same area. 



B. Estimation of the background 

Based on the tagging matrix, a jet has a certain prob- 
ability (rate) to be tagged as a heavy flavor jet depend- 
ing on its transverse energy, number of tracks associated 
to it and number of vertices in the event. For a jet 
with a transverse energy between 15 GeV and 40 GeV 
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FIG. 8: Background validation in control region CR1 for dou- 
ble tagged events from the multi-jet data (dots) and from the 
background model (solid histogram). The distributions are 
normalized to the same area. 



contains events that pass all our selection requirements 
without the minLKL cut and has a signal-to-background 
ratio of about 1/6. The signal region SR has events pass- 
ing all selection criteria defined in Section IIIIl Table IIVI 
summarizes all the regions used in our background mod- 
eling procedure. 



TABLE IV: Definition of the control regions used in the back- 
ground modeling procedure. The selection requirements that 
differentiate them are defined in Section ITTT1 

Region Clean-up Nj ets Kinem. minLKL 6-tag N events 



4J 


yes 


4 


no 


no 


no 


2,242,512 


BG 


yes 


6 


yes 


no 


no 


2652 


CR1 


yes 


6 


no 


no 


no 


380,676 


CR2 


yes 


6 


yes 


no 


yes 


930 


SR 


yes 


6 


yes 


yes 


yes 


72 



and with ten associated tracks, this probability is (7.2 
± 0.5)%. Using these probabilities we tag the jets as 
originating from a 6 quark. This tagging procedure is 
repeated 20,000 times in the events of the BG sample 
producing about ten million tagged configurations which 
are interpreted as background events. 

A tagged configuration is an event from the BG sam- 
ple where at least one of the six jets is tagged using the 
tagging matrix. Such kind of event can produce many 
tagged configurations which are unique if they have dif- 
ferent tagged jets or a different number of tagged jets. We 
find 12888 unique single tagged configurations, and 26715 
unique double tagged configurations. Of these, 657 (or sa 
(5.1 ± 0.2)%) single tagged configurations and 1180 (or 
w (4.4 ± 0.1)%) double tagged configurations pass the 
minLKL cut. We use these configurations, unique or 
duplicate, to form all relevant background distributions 
used for various checks and for the final measurement. 

The estimated number of background events is defined 
as the difference between the total number of events ob- 
served in the data sample and the expected number of 
ti events based on the standard model expectation for 
ti production cross section of 6.7 pb [28]. This normal- 
ization applies to the top quark mass reconstruction pro- 
cedure described in Section IVI1 and for the validation of 
the background model described below. 

We check various distributions of the background 
events modeled above against those from a sample of 
simulated events with 66 + 4 light partons in the final 
state. This simulated sample is built using ALPGEN [35j 
for the event generation, pythia for the parton show- 
ering, and the detector simulation as described in Sec- 
tion IIIIl Given our event selection, other background 
sources are expected to have smaller contributions com- 
pared to the one from 66 + 4 light partons and therefore 
affect less the relevant distributions. 

This check is performed in a control region (CR2) and 
in the signal region (SR) defined as follows. Region CR2 



Given that the BG sample used in our background 
model contains a small ti content, we need to correct 
all the background distributions built from it. The rela- 
tionship between a given uncorrected background distri- 
bution, fg, and the corrected one, f B orr is 

Jb - asfs 



JE 



I - as 



(23) 



where fs is the corresponding distribution for ti events 
and as is the fraction of the uncorrected background sam- 
ple due to it events. These quantities for ti are deter- 
mined from a sample of simulated ti events where Mt op 
= 170 GeV/c 2 by randomly tagging the jets using the 
tagging matrix defined in Section IV Al We choose the 
above value for the top quar k mass based on the value of 
the world mass average (34J at the time of this analysis; 
in Section I Villi we determine a systematic uncertainty 
due to this choice. The expression for as in region CR2 
is 



,CR2 



N CR2 



B CR2 + N CR2 



(24) 



where B CR2 is the background estimate in this region 
and Ng R2 is the number of ti events estimated using the 
tagging matrix. The expression for as in region SR is 



N CR2 e m, 



iLKL 



j^CR2 ( minLKL 



N 



CR2, minLKL 



(25) 



where 



^minLKL { -minLK L \ 



is the efficiency of the 



minLKL cut for ti (background) in the CR2 region. The 
efficiency for background is determined using the ratio of 
the number of uniquely tagged configurations before the 
minLKL cut (12,888 single tagged and 26,715 double 
tagged), and after the minLKL cut respectively (657 sin- 
gle tagged and 1180 double tagged). Table W\ shows the 
estimated number of background events B CR2 and the 
efficiency of the minLKL cut for background e r B " nLKL 
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in region CR2. Tables IVII and I VIII show the values for 
Ng R2 , and a9 R2 in region CR2 as well as the 



iLKL 



values of a° s K for simulated tt samples with different val- 
ues on M top . 



TABLE V: The estimated number of background events 
B CR2 and the efficiency of the minLKL cut for background 
€ ™ mLKL i n region CR2. The number of background events is 
the difference between the observed number of events and the 
expected number of tt events assuming a tt production cross 
section of 6.7 pb. 



Parameter 


Single Tag 


Double Tag 




711 


101 


^minLK L 


0.051 


0.044 



TABLE VI: The number of tt events, Ng R2 , with one jet iden- 
tified as 6 jets using the tagging matrix; in region CR2, the 
acceptance of the minLKL cut for tt events, e ^ tnLKL ) an d 
the values of the parameters of^ 2 (Eq.[24j), and ag R (Ea. l25|) 
for simulated tt samples with different values on Mtop- 



M top (GeV/c 2 ) 




e s 


n cm 

s 


4 H 


160 


29 


0.21 


0.039 


0.146 


170 


30 


0.20 


0.040 


0.144 


175 


28 


0.19 


0.038 


0.130 


180 


28 


0.18 


0.038 


0.124 



TABLE VII: The number of tt events, Ng R2 , with at least 
two jets identified as b jets using the tagging matrix; in re- 
gion CR2, the acceptance of the minLKL cut for tt events, 
e minLKL^ and thg values of the parame ters a% R2 (Eq.[24]|, and 
a§ R (Eq. I25p for simulated tt samples with different values on 



top • 



M top (GeV/c 2 ) 




e s 


n am 

s 


4 H 


160 


2 


0.31 


0.019 


0.133 


170 


2 


0.29 


0.019 


0.126 


175 


2 


0.29 


0.019 


0.126 


180 


2 


0.27 


0.019 


0.118 



The correction procedure uses by default the parame- 
ters as derived for M top = 170 GeV/c 2 . In the determina- 
tion of the systematic uncertainty due to this choice, we 
use the parameters corresponding to M top — 160 GeV/c 2 , 
and M top = 180 GeV/c 2 , respectively (see Section IVIIID . 
The parameters obtained using M top = 175 GeV/c 2 are 
given for reference in Table IVII as that mass value corre- 
sponds to a tt production cross section of 6.7 pb. 

Following this correction procedure, we compare 
shapes between our background model and the sample of 
simulated 66 + 4 light partons described above. First, we 
do this comparison in region CR2 where we look at the in- 
variant mass of all the untagged pairs of jets in the event 



(Fig. [9]). The values of the Kolmogorov-Smirnov proba- 
bilities are 25% for the samples with single tagged events, 
and 43% for the samples with double tagged events. For 
the signal region, we look at the invariant mass of all the 
untagged pairs of jets in the event (Fig. [T0|) and at the 
most probable per-event top quark mass (Fig. ITT)) . These 
are variables of particular interest in this region as they 
will be used in the reconstruction of the top quark mass 
and for the in situ calibration of the jet energy scale, 
as described in Section [VjJ Based on the comparison 
from Fig. QJjJ the Kolmogorov-Smirnov probabilities are 
90% for the single tagged events and 70% for the double 
tagged events. 



(a) 
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Dijet Mass (GeV/c ) 

FIG. 9: Invariant mass of pairs of untagged jets in control 
region CR2 for ALPGEN bb + 4 light partons (cross), and for 
the background model (solid): (a) for single tagged events 
(Kolmogorov-Smirnov probability is 25%) and (b) for double 
tagged events (Kolmogorov-Smirnov probability is 43%). 



These comparisons show good agreement between our 
data-driven background model and a simulated sample 
of events containing the final state 66 + 4 light partons, 
obtained using the ALPGEN generator. In SectionfVlITJwc 
evaluate the effect on the reconstructed top quark mass 
due to the limited statistics available in sample BG to 
construct the background model. 



16 



Background with 1 b-tag 
I | Model 
Monte Carlo 




Dijet Mass (GeV/c ) 



(b) 



Background with 2 b-tags 
I | Model 
Monte Carlo 




Dijet Mass (GeV/c ) 



(a) 



0.3 +~ 

> 

O 0.25- 

c : 

■S 0.2 - 
o 

ra ; 

£ 0.15^ 



0.1- 



0.05- 



0.3 + 
0.25 + 

02 r 
0.15 + 

0.1- 
0.05- 



Background with 1 b-tag 
| | Model 
Monte Carlo 



140 



160 



180 



200 



220 

2. 



Event Top Mass (GeV/c ) 



(b) 



Background with 2 b-tags 
I | Model 
Monte Carlo 



140 



160 



180 



200 



220 

2. 



Event Top Mass (GeV/c ) 



FIG. 10: Invariant mass of pairs of untagged jets in sig- 
nal region for ALPGEN 66 + 4 light partons (cross), and for 
the background model (solid): (a) for single tagged events 
(Kolmogorov-Smirnov probability is 90%), and (b) for double 
tagged events (Kolmogorov-Smirnov probability is 70%). 



FIG. 11: Event by event most probable top quark masses in 
the signal region for ALPGEN 66 + 4 light partons (cross), and 
for the background model (solid): (a) for single tagged events, 
and (b) for double tagged events. 



VI. TOP QUARK MASS ESTIMATION 

Our technique starts by modeling the data using a mix- 
ture of signal events obtained from ti simulation and of 
background events obtained via our background model. 
The events are represented by two variables: the invari- 
ant mass of pairs of untagged jets and an event-by-event 
reconstructed top mass described below. These two vari- 
ables are used to form distributions (templates), sepa- 
rately for ti events and for background events. In the 
case of tt events, the templates are parameterized as a 
function of the mass of the top quark and the jet energy 
scale (JES) variable (defined below). For background no 
such dependences are expected since they contain no top 
quark and the jet energies used for the background mod- 
eling are taken from data. The measured values for the 
top quark mass and for the JES are determined using a 
likelihood technique described in Section fVI Bl 

The largest contribution to the systematic uncertainty 
on the top quark mass is due to the uncertainty on the jet 
energy scale. To limit the impact of this systematic on 
the total uncertainty on the top quark mass, we use an in 
situ calibration of the jet energy scale via the I'F-boson 



mass. We measure a parameter JES that represents a 
shift in the jet energy scale from our default calibration 
as defined in Section IIIII This quantity is expressed in 
units of the total nominal jet energy scale uncertainty 
a c that is derived following the default calibration. This 
uncertainty depends on the transverse energy, pseudora- 
pidity, and the electromagnetic fraction of the jet energy. 
On average, the uncertainty is approximately equivalent 
to a 3% change in the jet energy scale for jets in tt events. 
By definition, JES = a c represents our default jet en- 
ergy scale; JES = 1 cr c corresponds to a shift in all jet 
energies by one standard deviation; and so on. 

The templates for ti events are determined from sam- 
ples of simulated ti events with M top ranging from 
150 GeV/c 2 to 200 GeV/c 2 in steps of 5 GeV/c 2 . We 
also include the sample where M top = 178 GeV/c 2 for a 
total of twelve different tt simulated mass samples. In 
addition to the variation of the top quark mass, for each 
value of M top we consider seven values for JES between 
-3 <7 C and 3 <r c , in steps of 1 a c . We use the events ob- 
tained from our background model to form the templates 
for the background. 
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A. Definition and parameterization of the 
templates 

The first set of templates, called the top templates, 
is built using a variable (m t ° v p t ) determined using the 



matrix element technique. We call 



top 



the event-by- 



event reconstructed top quark mass, and it represents 
the mass value that maximizes the event probability de- 
fined in Section [W] We find the value of rn° v p t by evalu- 
ating the event probability in the range 125 GeV/c 2 — > 
225 GeV/c 2 . When building the templates, we drop the 
events for which the event probability is naturally max- 
imized at mass values outside this range. These events 
accumulate at the edges of the distribution making dim- 
cult the parameterization described below. 

For it events, the function '(m*°?|M top , JES) used 
to describe the shape of these templates is a normalized 
product of a Breit-Wigner function and an exponential: 



Pt°*{m t Z\M top ,JES) = 



o cxp (-(m*°t - 01)03) 
N{M top , JES) 
02/277 
X (m t e Z-a l y + al/4 



(26) 



where the parameters on depend on M top and on JES. 
The normalization is set by N(M top , JES) that has the 
following expression: 



N(M top , JES) = Y,(P3k + Psk+i ■ JES 

k=0 

+p 3k+2 ■ JES 2 ) ■ (M top ) k (27) 

The parameters a* (Eq. [26]) depend on M top and JES as 
follows: 



P15 

P3i+13 + P3i+li ■ M top 



i = 

P3i+i5 ■ JES i = 1,2,3 
(28) 

In Eqs. [27] and [28] the parameters pi are constants de- 
termined from the simultaneous fit of the top tem- 
plates from all 84 ti samples with the function 
Pl°P{m c Z\Mto P ,JES). Figure HH shows the function 
M top , JES) for JES = a c and various val- 
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FIG. 12: The function fitting the top templates for ti events 
at nominal JES and for various hypotheses of the top quark 
mass in the case of events with one tagged jet. A similar pa- 
rameterization is obtained for events with at least two tagged 
jets. 



the numerator are over the templates built from sim- 
ulated ti events for a given M top (12 values) and JES 
(7 values). The third sum is over all the bins with 
more than 5 entries from each template. We obtain 
X 2 / N dof = 1554/1384 = 1.12 for the sample with one sec- 
ondary vertex tag and x 2 /N do f = 1469/1140 = 1.29 for 
the sample with two secondary vertex tags correspond- 
ing to very small x 2 probabilities. From the values of the 
quantity x 2 /Njof, we conclude that the parameterization 
of the top templates is not very accurate, and we expect 
some bias in the reconstruction of mass and JES. The 
procedure for bias removal is described in Section [VI CI 

The top templates for background events are built us- 
ing the matrix element in the same way as for ti events. 
The shape of the background template is fitted to a nor- 
malized Gaussian. Figure Q2] shows separately the re- 
sulting parameterized curves of background templates for 
single and double tagged background events. 

The second set of templates, the dijet mass templates, 
are formed by considering the invariant mass tn^ vt of all 
possible pairs of untagged jets in the sample. This vari- 
able is correlated to the mass of the W boson, and plays 
a central role in the in situ calibration of the jet energy 



similar parameterization is obtained for events with at 
least two tagged jets. 

To determine how well the parameterization in Eq. [55] 
describes the templates, we calculate the x 2 divided by 
the number of degrees of freedom, Ndof, as follows 



used to fit the dijet mass templates is a normalized sum 
of two Gaussians and a Gamma function: 



pW(m% t \M top ,JES) = -— 



1 



X 2 /N dof = 



E12 ^7 tr^Nbins ( 
rn — 1 Z-/j — 1 \ 



(29) 



/3 6 /3 7 exp ( - p 7 (m evt W - /3 8 )) 



r(i + /3 9 ) 



N' {Mtop, JES) 
■ (mevtW - fa)? 9 



where hun is the bin content of the template histogram 
and f i n is the value of the function from Eq. [55] at 
the center of the bin. In Eq. [511 the first two sums in 



/5 2 \/27T 
03 

5 V2ir 



exp 



exp 



;so) 
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(a) 

Background Top Template (1 b-tag) 
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FIG. 13: Top templates for (a) single tagged background 
events and for (b) double tagged background events. 



where the parameters (3i depend on M top and on JES 



The normalization is set by N'(M t0 p, 
following expression: 



JES) that has the 
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FIG. 14: The function fitting the dijet mass templates for 
ti events with M top = 170 GeV/c 2 and various values of JES 
in the case of events with one tagged jet. A similar parame- 
terization is obtained for events with at least two tagged jets. 



From the values of the quantity x 2 /Ndof we reach the 
same conclusion as in the case of the parameterization 
of top templates: the parameterization of the dijet mass 
templates is not very accurate and some bias is expected 
when the top mass and JES are reconstructed. 

The dijet mass template for background is built in the 
same way as for the ti templates. The background tem- 
plate is fitted to a normalized sum of two Gaussians and 
a Gamma function. This combination of functions pro- 
vided the best fit of the dijet mass shapes. Figure [15] 
shows separately the resulting parameterized curves of 
background templates for single and double tagged back- 
ground events. 



Likelihood definition 



N'(M top , JES) = YS^ k + ' JES 



k=0 



+q 3k+ 2 ■ JES 2 ) ■ {Mtopf 



(31) 



The parameters pi depend on M top and JES as follows: 

Pi = <73i+6 + <?3i+7 • M top + • JES, i = 0, 9 (32) 

In Eqs. I3T1 and l32l the parameters qi are constants deter- 
mined from the simultaneous fit of the top templates from 
all 84 it samples with the function Pf ' {m^ vt \M top , JES). 
Figure [H shows the function (mf vt \M top , JES) for 
M top — 170 GeVc 2 and various values of JES in the case 
of events with one tagged jet. A similar parameterization 
is obtained for events with at least two tagged jets. 

As in the case of top templates, we calculate (Eq. [25]) 
the quantity x 2 /Ndof to describe the performance of the 
parameterization of the dijet mass templates. We obtain 
X 2 /Ndof = 3551/2636 = 1.35 for the sample with one 
secondary vertex tag and x 2 / N dof = 2972/2524 = 1.18 
for the sample with at least two secondary vertex tags. 



The mass of the top quark and the value of JES are 
determined by maximizing a likelihood function built us- 
ing the two sets of templates described in Section IVI Al 
Assuming that the data sample is the sum of n s ti events 
and n b background events, we can calculate the likelihood 
function connected to a generic template P* as 



Cf{Mto P ,JES)= Yl 



f fn s -Pi(x evt \M top ,JES) 



n s + rib 

n b ■ P b f (x evt ) 
n s + n b 



(33) 



where index / can either be top when the variable x ev t 
represents the event- by- event reconstructed top mass, or 
W for the invariant mass of pairs of light flavor jets. 

The number of ti events, n s , is constrained to the ex- 
pected number of ti events, n e s xp , via a Gaussian 



C ns = exp - 



2<7„ 



(34) 
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Background Dijet Mass Template (1 b-tag) 
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FIG. 15: Dijet mass templates for (a) single tagged back- 
ground events and for (b) double tagged background events. 



tags: 



. r 



(36) 



As described in Section IIII1 the jet energy scale JES 
can be determined from independent detector calibra- 
tions. We include this knowledge in the likelihood in the 
form of a Gaussian constraint on our variable JES. This 
Gaussian has a mean equal to the expectation on JES 
from the independent calibration, JF<S exp = a c , and 
a width equal to 1 a c which is the uncertainty on this 
expectation. 



£jes = exp 



{JES - JES exp f 



(37) 



The term expressing the constraint on the JES vari- 
able is multiplied together with the likelihood function 
for each heavy flavor sample to obtain the final likeli- 
hood function used to reconstruct the top quark mass 
shown in Eq. 1381 



C-ltaq ' C-2taq 1 £ 



JES 



(38) 



Following the maximization of the likelihood function 
shown in Eq. [38] we will obtain six numbers: the recon- 
structed top quark mass M t , the reconstructed JES vari- 
able JES ou t, and the number of events with different 



number of tags for tt , 2j an d for background, 



'1,2 



The 



statistical uncertainties on these numbers, SM t , SJES out , 
5nf 2 1 and 8nf 2 are obtained from the points where the 
log-likelihood changes by 0.5. 



with mean equal to n e s xp and width equal to o" n «*p, the 
uncertainty on the expected number of ti events. 

The expected numbers of signal events, n e s xp , are 13 for 
the single tagged sample and 14 for the double tagged 
sample corresponding to a theoretical cross section of 
6. 7j^9 pb [28| and an integrated luminosity of 943 pb -1 . 
The value of the theoretical cross section assumes a top 
quark mass of 175 GeV/c 2 . The values for a n ^v are 
3.7 for the single tagged sample and 3.9 for the double 
tagged sample, which take into account both statistical 
effects (assuming a Poisson distribution) on n e s xp and sys- 
tematic ones based on the uncertainty on the theoretical 
cross section. 

The sum of tt and background events, n s + n&, is con- 
strained to the total number of observed events in the 
data, Nl°\ : via a Poisson probability with a mean equal 

toiv*°! 

r _ {K$) n * +n » exp(-A^f) 



Multiplying the terms expressing the constraints on 
the number of events and the likelihood functions for 
each template, we obtain separate likelihood functions 
for events with one tag and for events with at least two 



C. Calibration of the method 

Using samples of simulated ti events and the back- 
ground sample built based on the model presented in 
Section [VJ we form simulated experiments for a series 
of JES and M top input values. We then verify that the 
reconstructed values of the top quark mass and JES ob- 
tained following the maximization of the likelihood func- 
tion (Section IVIB[) are in agreement with the input val- 
ues. The simulated experiments are a mixture of ti events 
and background events reflecting the expected sample 
composition of the data. In each simulated experiment, 
the number of ti events is drawn from a Poisson distribu- 
tion of mean equal to the expected number of ti events 
passing the selection, as determined from simulation (Ta- 
ble IVIIip . The number of background events is also 
drawn from a Poisson distribution with a mean equal 
to the difference between the observed number of events 
(see Section IIII1 Table IJ) and the expected number of 
ti events. 

In order to reduce the statistical uncertainties on po- 
tential biases in mass or JES reconstruction, about 10,000 
simulated experiments are performed. Due to the finite 
size of simulated ti event samples and background sample 
the simulated experiments share events between them. 
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TABLE VIII: Number of events for samples of simulated 
ti events with M top ranging between 150 GeV/c 2 and 
200 GeV/c 2 . The numbers correspond to a integrated lumi- 
nosity of 943 pb _1 , after all selection requirements are made. 
The observed number of events is also shown. 



M top (GeV/c^) 


Single Tag 


Double Tag 


150 


18 


14 


155 


17 


15 


160 


16 


14 


165 


16 


14 


170 


15 


14 


175 


13 


14 


178 


14 


14 


180 


12 


13 


185 


11 


11 


190 


9 


11 


195 


9 


10 


200 


7 


8 


Total Observed 


48 


24 



1 



-1 

-2 
-3 



FIG. 16: JES versus Top Quark Mass plane. The points rep- 
resent the reconstructed JES, JES out, and top quark mass M t 
and have attached their corresponding statistical uncertain- 
ties, SJESout and 8M t . The vertical dashed lines correspond 
to the true values of the mass, while the horizontal lines corre- 
spond to the true values of JES. For a perfect reconstruction 
the points should sit right at the intersection of the dashed 
lines. 
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These overlaps result in correlations between the results 
of the mass and JES reconstructions from each simulated 
experiment. These correlations are taken into account 
following the study found in Ref. 36]. The typical value 
for the correlation between any two simulated experi- 
ments is 6%. 



The variables extracted from each simulated experi- 
ment are: the values of mass, Mf E , and JES, JES^ 
that maximize the likelihood defined in section [VI Bl the 
statistical uncertainties on the above variables, &Mf E 
and SJESg^ and the pulls for these variables as defined 
by 



PulL, 



Ml 



PE 



Mi 



Lop 



Pull 



JES. 



PE 



5M t PE 
J ESt, 



JES 



(39) 



where JESt rue is the value of JES used in the simulation. 

The distribution of the top quark masses Mf E recon- 
structed in each simulated experiment is fitted to a Gaus- 
sian. The mean of this Gaussian is interpreted as the re- 
constructed top quark mass of the sample, M t , while the 
width of the Gaussian represents the expected statistical 
uncertainty on it, SM t . We apply the same procedure to 
determine the reconstructed value of JES, JES ou t, and 
its expected statistical uncertainty, SJES ou t- 

Figure [TBI shows the reconstructed JES and the recon- 
structed top mass represented by the points, versus the 
true JES and true top mass represented by the grid. Ide- 
ally the points should match the grid crossings, but there 
is a slight bias which has to be removed. The bias is re- 
moved in the mass- JES plane by solving the system in 



Eq.gD] 



M t = C m + S, 
J ES ou t = C 



{M top - 175) 



Sj • J ESfrue 



(40) 



for M t0 p and JESt rue . The parameters C m , Cj, S m , and 
Si have the form 



C m — a l + a 2 ' JEStrue 

S m = a 3 + a 4 ' JEStrue 

Cj = 61 + 6 2 • Mtop 

Sj = 63 + 64 -Mtop 



(41) 



where the parameters {ai} and {bi} from Eq. [41] are 
listed in Table IIXI They are determined from a linear 
fit of the distributions of C m and S m versus JESt rMe 
(Figs. [17] and [18]) . and of Cj and Sj versus M top , re- 
spectively (Figs. [H and [20]) ■ 
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FIG. 17: Distribution of parameter C m (Eq. [41} as a function 
of JES. 
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FIG. 18: Distribution of parameter S m (Eg. I41|l as a function 
of JES. 



TABLE IX: Values of the parameters describing best the lin- 
ear dependence on the true JES and on the true Mt op , of the 
intercept and slope of the Mtop calibration curve and of the 
JES calibration curve respectively. 



Parameter 


Value 


Uncertainty 


ai 


175.0 


0.1 


a% 


-0.09 


0.05 


a 3 


0.975 


0.008 




0.016 


0.004 


h 


0.6 


0.3 


b 2 


-0.003 


0.002 


b 3 


1.35 


0.15 


b± 


-0.0021 


0.0008 
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FIG. 19: Distribution of parameter Cj (Eq. I41[l as a function 
of Mtop. 
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FIG. 20: Distribution of parameter Sj (Eq. I41|l as a function 
of Mtop. 



The uncertainties 8M t and 5JES ou t on the recon- 
structed values M t and JES out are also affected by the 
bias in the reconstruction technique and we need to cor- 
rect them as well. By differentiating Eq. 3D] with respect 
to M top and JESt, ue , we obtain another system of equa- 
tions to be solved for the corrected uncertainties, SM^ orr 



and SJESZ7- 



SM t = X m ■ SJES tr 
S JES out = Xj ■ 8M c t orr 



e + Y m - 5M t corr 
Yi ■ 8JESZ7 



(42) 



The parameters X m , Xj, Y m , and Yj from Eq. [42] de- 
pend on M top and JES trMe as shown in Eq. I43I Solving 
Eq. H2l provides the best estimate of the uncertainties on 
M t and on JES on f. 



X m = a2 + a 4 ■ (Mtop - 175) 

Y m = a 3 + a 4 ■ JEStr tie 

Xj = &2 + t>4 ' J EStrue 

Yj = b 3 + 64 • Mtop 



(43) 



Following the procedure for removing the bias in the 
mass reconstruction, the distribution of pull means ex- 
tracted using simulated experiments (Fig. [5TJ) validates 
our bias correction as, on average, the pull mean is esti- 
mated to be consistent with zero within the uncertainty. 
The width of the pull distribution is used to determine 
the corrections on the statistical uncertainties 6M£° rr 
due to non-Gaussian behavior of the likelihood function 
(Eq. [38]) . Figure [22] shows the mass pull widths versus 
top quark mass M t0 p- In these plots the JESt rMe of the 
tt samples is a c . Similar pulls are obtained from tt sam- 
ples with different values of JES tr „ e . Based on these fig- 
ures, it is estimated that the uncertainty on M t has to 
be increased by 11%. 

For the reconstruction of JES, Fig. |2"31 shows the pull 
means versus JESt rue , while Fig.HMlshows the pull widths 
versus JES true . In both plots, M top = 170 GeV/c 2 . Sim- 
ilar pulls are obtained from tt samples with different val- 
ues of M t0 p. Regarding the bias correction, we reach the 
same conclusion as in the case of the mass reconstruction 
that, on average, the pull mean is estimated to be consis- 
tent with zero within the uncertainties. Based on Fig. [24] 
it is estimated that the uncertainty on the JES OU f has to 
be increased by 6%. 

In order to further establish the robustness of the tech- 
nique, the mass and JES are measured in samples for 
which the true values are unknown to the authors of this 
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FIG. 21: Pull means versus Mt op in the case of the recon- 
struction of top quark mass in samples with JEStrue = rr c . 
The continuous line represents the average pull mean and the 
dashed lines show the uncertainty on it. 



FIG. 24: Pull widths versus versus JEStrue in the case of the 
reconstruction of JES in samples with M top = 170 GeV/c 2 . 
The continuous line represents the average pull width and the 
dashed lines show the uncertainty on it. 
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FIG. 22: Pull widths versus M top in the case of the recon- 
struction of top quark mass in samples with JEStrue = a c . 
The continuous line represents the average pull width and the 
dashed lines show the uncertainty on it. 
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FIG. 23: Pull means versus JEStrue in the case of the re- 
construction of JES in samples with Mt op = 170 GeV/c 2 . 
The continuous line represents the average pull mean and the 
dashed lines show the uncertainty on it. 



paper. To validate the mass reconstruction we utilize five 
such blind samples: three generated with Herwig and 
two with Pythia. The value of JES in these samples 
corresponds to 0, the nominal jet energy scale. The re- 
constructed top quark mass in each of these samples is 
the most probable value obtained from 10,000 simulated 
experiments. Each simulated experiment is formed com- 
bining the tt events in the blind samples and the back- 
ground events from the background model such that on 
average the total number of events is equal to the ob- 
served value (see Table IVIIip . The size of the tt content 
is 15 single tagged events and 14 double tagged events. 

Following the mass reconstruction technique and 
the calibration described in this paper, the differ- 
ences between the true top quark mass values and 
the reconstructed ones are: -0.2, 0.3, 0.6, -0.7, and 
1.1 GeV/c 2 . The statistical uncertainty on these num- 
bers is 0.8 GeV/c 2 . The first two numbers correspond 
to the Pythia samples. To validate the JES reconstruc- 
tion, another five blind samples are used for which the jet 
energy scale is modified. The generator used here is Her- 
WIG and the value of the top quark mass is 170 GeV/c 2 . 
The differences between the true JES values and the re- 
constructed ones are: 0.1, 0.3, 0.0, 0.1, and -0.1 a c . The 
statistical uncertainty on these numbers is 0.4 a c . 

In conclusion, both the mass and JES reconstructed 
values are compatible with true ones within the statistical 
uncertainties. This additional check gave us confidence 
that the method described here can be reliably applied 
on the data to reconstruct JES and the top quark mass. 



D. Expected statistical uncertainty 



In Fig. 12 
quark mass 
JES( rue = a c 



we show the expected uncertainty on top 



r5M t corr , 



versus M top , for samples with 
Since the expected number of tt events 
depends on the M top , the uncertainty 5M^ orr depends 
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removing the parameterization as a function of JES and 
by maximizing a likelihood built only with the top tem- 
plates corresponding to JES = <r c . We reconstruct the 
top quark mass for two samples with M top — 170 GeV/c 2 , 
but with different values for JESt rMe : +1 <r c , and -1 <r c , 
respectively. Taking half of the difference between the 
two reconstructed M t determines the systematic uncer- 
tainty due to jet energy scale as 2.2 GeV/c 2 , which is 10% 
more than in the case of using the in situ calibration and 
the JES parameterization. 



VII. RESULTS 



FIG. 25: Expected uncertainty on top quark mass, <5M t c °" , 
versus Mt op , for samples with JEStrue = a c . This uncer- 
tainty includes the uncertainty due to statistical effects and 
the systematic uncertainty due to jet energy scale. 
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FIG. 26: Expected uncertainty on JES, SJESout, versus 
JEStr^e, for samples with M top = 170 GeV/c 2 . 



on it too. Figure shows the expected uncertainty on 
JES, 5JES^ r , versus JES trne , for samples with M top 
= 170 GeV/c 2 . The uncertainties in Fig. [231 and I2TJ1 are 
corrected for bias, but not for pull widths (non-Gaussian 
effects). 

The expected uncertainties shown in Fig. [55] contain 
both the statistical uncertainty on the top quark mass 
and the uncertainty due to jet energy scale. In order to 
disentangle the statistical uncertainty on M t from the one 
due to jet energy scale, we reconstruct the top quark mass 
by maximizing the likelihood for a fixed value of JES. Fol- 
lowing this reconstruction for M top = 170 GeV/c 2 and 
JESt rue = cr c , the uncertainty on the top quark mass 
is 2.5 GeV/c 2 . In comparison, when JES is not fixed 
the expected uncertainty (Fig. [25]) on M t is 3.2 GeV/c 2 . 
Subtracting these two numbers in quadrature we esti- 
mate that the systematic uncertainty on M t due to jet 
energy scale is 2.0 GeV/c 2 . 

We can determine the systematic uncertainty on M t 
due to the jet energy scale in the absence of the in situ 
calibration (provided by the dijet mass templates), by 



Applying the event selection described in Section |TTT] 
to the multijet data corresponding to an integrated lu- 
minosity of 943 pb _1 , we observe 48 events with one sec- 
ondary vertex tag and 24 events with at least two sec- 
ondary vertex tags. Performing the likelihood maximiza- 
tion and applying the corrections described in Section [VTl 
for this sample, we measure a top quark mass of 171.1 ± 
3.7 GeV/c 2 and a value for JES of 0.5 ± 0.9 a c . 

Figure [2"T1 shows the distributions of reconstructed top 
quark masses for data (dots) and for the combination 
(light) of signal and background templates that best fit 
the data. The background (dark) contribution is shown 
normalized to the data as determined by the fractions 
obtained from the likelihood fit. There are two sets of 
distributions corresponding to the sample with only one 
secondary vertex tag fFig.l2"Wa)) and to the sample with 
at least two secondary vertex tags (Fig. l2"7T b)). 

The minimized negative log-likelihood is shown in 
Fig. [55] as a function of the top mass and JES after cor- 
recting for bias (Eqs. 201 and |4"2"|) and for non-Gaussian 
effects (Section IVI C[) . The central point corresponds to 
the minimum of the negative log-likelihood, while the 
contours are given at a number of values of AlnL, the 
change in negative log-likelihood from its minimum. 

Table [X] lists the number of events for ti and for back- 
ground for the one- and two-secondary vertex tags cases, 
as measured following the minimization of the two di- 
mensional likelihood of Eqn. [38] on the data. 

TABLE X: Measured sample composition of the multijet data 
sample for a luminosity of 943 pb _1 , passing the event selec- 
tion. The second column (1 tag) gives the number of events 
with only one secondary vertex tag, while the third column 
(>2 tags) is for the events with at least two secondary vertex 
tags. 



Number of Events 


1 tag 


>2 tags 


Signal (ft ) 


13.2 ± 3.7 


14.1 ± 3.4 


Background 


34.6 ± 7.2 


9.2 ± 4.3 


Total Observed 


48 


24 



Using a it Monte Carlo sample with a top quark mass 
equal to 170 GeV/c 2 and the number of signal and back- 
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FIG. 27: Reconstructed top mass for data (points), best sig- 
nal+background fit (light) and background shape from the 
best fit (dark): for (a) samplee with only one secondary ver- 
tex tag, and (b) the sample with at least two secondary vertex 
tags. 
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FIG. 28: Contours of the likelihood in the M top and JES 
plane at a number of values of AlnL, the change in negative 
log-likelihood from its maximum. 



ground events from Table [Xj we perform simulated ex- 
periments and determine the distribution of expected un- 
certainty on the top quark mass due to statistical effects 
and JES. About 41% of the simulated experiments have a 
combined uncertainty on the top quark mass lower than 



the measured value of 3.7 GeV/c 2 . This can be seen in 
Fig. [551 where the histogram shows the results of the sim- 
ulated experiments and the vertical line represents the 
measured uncertainty. In conclusion, the measured com- 
bined statistical and JES uncertainties on the top mass 
agree with the expectation. 
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FIG. 29: Distribution of expected statistical uncertainty on 
M t (histogram) and the measured uncertainty (vertical line). 
In about 41% of simulated experiments a statistical uncer- 
tainty on the top quark mass smaller than in the experiment 
is found. 



In order to obtain the contribution of the uncertainty 
in jet energy scale to the uncertainty on the top quark 
mass, the minimization of the 2D likelihood is modified 
such that the JES parameter is fixed to 0.5 a c (the value 
of JES from the likelihood minimization). Following this 
procedure the uncertainty on the top mass is 2.8 GeV/c 2 . 
Subtracting in quadrature this value from the uncertainty 
obtained when the JES was not fixed (3.7 GeV/c 2 ), we 
estimate the systematic uncertainty contributed by JES 
as 2.4 GeV/c 2 . 



VIII. SYSTEMATIC UNCERTAINTIES 

We model tt events using simulated events, which do 
not always accurately describe all effects we expect to see 
in the data. The major sources of uncertainties appear 
from our understanding of jet fragmentation, our mod- 
eling of the radiation from the initial or final partons, 
and our understanding of the proton and antiproton in- 
ternal structure. Apart from these uncertainties, which 
are present in most top quark measurements, we also ad- 
dress other issues specific to the present method such as 
the shape of the background top templates following the 
correction for tt content, and the uncertainty in the two 
dimensional correction of the reconstructed top mass and 
JES. 
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Systematic uncertainties related to jet energy 
scale 



1. b-jet energy scale 

We study the effect of the uncertainty on the modeling 
of b quarks due to the uncertainty in the semi-leptonic 
branching ratio, the modeling of the heavy flavor frag- 
mentation, and due to the color connection effects. 

To determine this we reconstruct the top mass in a 
simulated ti sample (M top — 175 GeV/c 2 ) where we select 
6-jets by matching the b quarks to a jet. The matching 
procedure requires \J (A?7) 2 + (Ac/)) 2 < 0.4 between the 
quark and the jet. We modify the energy of the 6-jets 
by 0.6% corresponding to the uncertainty on the 6-jet 
energy due to the effects listed above [37l]. The resulting 
systematic uncertainty on the top quark mass due to the 
uncertainty on the 6-jet energy scale is 0.4 GeV/c 2 . 



2. Residual jet energy scale 



TABLE XI: Residual jet energy scale uncertainty on the top 
mass. The sum in quadrature of all the effects represents the 
total residual systematic uncertainty due to jet energy scale. 

Source of Systematic 



SM t {GeV/c z ) 



Response Relative to Central Calorimeter 0.2 

Multiple Interactions 0.1 

Modeling of Hadron Jets 0.5 

Modeling of the Underlying Event 0.0 

Modeling of Parton Showers 0.5 

Energy Leakage 0.1 

Total Residual JES Uncertainty 0.7 



change in the value of the reconstructed top quark mass 
by 0.9 GeV/c 2 which is added as a systematic uncer- 
tainty. 



From the two dimensional fit for mass and JES, we ex- 
tract an uncertainty on the top quark mass that includes 
a statistical component as well as a systematic uncer- 
tainty due to the uncertainty on the jet energy scale. 
This systematic uncertainty is a global estimate of the 
uncertainty due to jet energy scale. Additional detailed 
effects arise from the limited understanding of the indi- 
vidual contributions to JES (see Section PVT]) . 

For this we have to study the effect on the top mass 
reconstruction from each of these sources: angular de- 
pendence of the calorimeter response, contributions by 
multiple interactions in the same event bunch, modeling 
of hadron jets, modeling of the underlying event, model- 
ing of parton showers and energy leakage. A simulated 
ti sample (M top = 175 GeV/c 2 ) is used where the en- 
ergies of the jets have been shifted up or down by the 
uncertainty at each level separately. We reconstruct the 
top quark mass for each case, without applying any con- 
straint on the value of JES. Table IXII shows the aver- 
age shift on the top mass at each level, and the sum in 
quadrature of these effects. We conclude from this study 
that the uncertainty on the top quark mass contributed 
by these corrections to the jet energy is 0.7 GeV/c 2 . 

B. Systematic uncertainties due to background 

1. Background modeling 

Based on the background model (Section [Vj) , we as- 
sume M top = 170 GeV/c 2 to correct for the presence 
of ti events in the background distributions. To esti- 
mate the uncertainty associated with making this as- 
sumption, we modify our background model considering 
a 10 GeV/c 2 variation on M top used in the default back- 
ground correction procedure. This variation results in a 



2. Background statistics 

Another effect we address here is that of the limited 
statistics (~ 2600 events, see Section [Vj of the data sam- 
ple used to model the background. To estimate this effect 
we vary the parameters describing the background tem- 
plates within their uncertainties. Using the procedure 
described below, we find that the effect on the recon- 
structed top quark mass due to variation on the back- 
ground dijet mass templates is negligible. This is not the 
case of the background top templates. 

For simplicity, we label the parameters of this template 
as Constant, Mean and Sigma, representing the constant, 
the mean and the width of the Gaussian function describ- 
ing the background top template. In order to find the 
uncertainties on these parameters, we vary the content 
of the top template histograms for background assuming 
that each bin fluctuates according to a Poisson probabil- 
ity. This variation is done 10,000 times, and each time 
we extract and form distributions with the values of the 
three parameters, Constant, Mean and Sigma after ap- 
plying the correction due to the residual ti content in the 
sample. We use the spread of these distributions as the 
uncertainties on the parameters of the top templates for 
background. 

Table IXIll shows the values of these uncertainties sepa- 
rately for the sample with only one secondary vertex tag 
(ltag) and for the sample with at least two secondary 
vertex tags (2tags). Varying the parameters of the back- 
ground top templates within these uncertainties results in 
a shift in the reconstructed top quark mass of 0.4 GeV/c 2 
and we add this as a new systematic uncertainty. 
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TABLE XII: Parameters of the top templates for background 
events. These templates have been described in Section TVIAI 
The second column is for the single tagged sample (ltag), 
while the third column is for the double tagged sample (2 
tags). 



Parameter 


1 tag 


2 tags 


Constant ((GeV/c^)" 1 ) 
Mean (GeV/c 2 ) 
Sigma ((GeV/c 2 ) 2 ) 


0.015±0.001 
159±3 
1790±272 


0.013±0.001 
163±3 
3280±712 



C. Initial and final state radiation 

The top quark mass measurement is affected by how 
we model the initial and final state gluon radiation. This 
radiation affects the jet multiplicity in the event as well 
as the energy of the jets, which in turn affect the top 
quark mass reconstruction. 

The amount of radiation from the initial partons is 
controlled in our simulated ti samples by the DGLAP 
evolution equation [38| [39]. The parameters of these 
equations are Aqcd and K (the scale of the transverse 
momentum for showering) . In the case of the initial state 
radiation, these parameters are tuned in the simulation 
to reflect the amount of radiation observed in Drell-Yan 
events 37]. The amount of radiation, proportional to the 
average transverse momentum of the leptons, is found to 
depend smoothly on the invariant mass of the leptons, 
over a range of energies extending up to the range of 
ti events. Two sets of values for the parameters Aqcd 
and K are determined to cover the variation of this de- 
pendence within one standard deviation (ctisr)- 

We generate two samples of ti events (M top — 
178 GeV/c 2 ) where the parameters Aqcd and K corre- 
spond to +(7isr (increase the amount of radiation), and 
—&ISR (decrease the amount of radiation), respectively. 
Using the default set of values, the reconstructed top 
quark mass is 178.6 GeV/c 2 . For the sample with +ctisr 
the reconstructed top quark mass is 178.9 GeV/c 2 , and 
for the one with —uisr the reconstructed top quark mass 
is 178.6 GeV/c 2 . Taking the maximum change in top 
mass, we quote 0.3 GeV/c 2 as the uncertainty due to 
initial state radiation modeling. 

Using the same variation of the parameters Aqcd and 
K to describe the variation of the final state radiation, 
we reconstruct the top quark mass to be 177.7 GeV/c 2 
in a sample with increased radiation and 177.4 GeV/c 2 
when we decrease the amount of radiation. Taking into 
account the value of the reconstructed top quark mass 
in the default case, the maximum change in the recon- 
structed top quark mass is 1.2 GeV/c 2 representing the 
systematic uncertainty on the modeling of the final state 
radiation. 



D. Proton and antiproton PDFs 

In our default simulation, the internal structures of 
the proton and antiproton are given by the CTEQ5L set 
of functions, and for a ti sample with M top = 178 GeV/c 2 
the reconstructed top quark mass is 178.6 GeV/c 2 . For 
the same M top value, using a different set of functions 
(cteq6m) results in a reconstructed top quark mass of 
178.7 GeV/c 2 . Within the CTEQ6M set, there are 20 inde- 
pendent parameters whose uncertainties are representa- 
tive of the uncertainty on the modeling of such structure 
functions [Z(|. Adding in quadrature all the 20 offsets 
observed in top quark mass reconstruction due to these 
variations, we get 0.4 GeV/c 2 . 

Also, it is known that the value of Aqcd has a direct 
effect on the shape of the structure functions. In order 
to estimate this effect, we chose yet another set of PDFs 
given by MRST, and reconstructed the top mass for 
Aqcd = 228 GeV to get a top mass of 177.4 GeV/c 2 , and 
for Aqcd = 300 GeV to get a top mass of 177.7 GeV/c 2 . 
Therefore the systematic uncertainty due to the value of 
Aqcd is 0.3 GeV/c 2 . 

Adding the two contributions in quadrature, we 
quote that the total systematic uncertainty due to the 
choice of structure functions of proton and antiproton is 
0.5 GeV/c 2 . 

E. Other systematic uncertainties 

The default Monte Carlo generator used to determine 
our templates is Herwig, which is known to differ from 
the Pythia generator. For simulated ti samples with 
Mtop — 178 GeV/c 2 , we reconstruct the top quark mass 
as 177.6 GeV/c 2 using Herwig as the generator, and 
178.6 GeV/c 2 using Pythia. We assign a systematic un- 
certainty due to the choice of the Monte Carlo generator 
of 1.0 GeV/c 2 representing the difference between the re- 
constructed top quark masses in Herwig and Pythia. 

In addition, we have varied the parameters of Eq. 0D] 
within their uncertainties as listed in Table ITXl and ob- 
tained new values of the top quark mass. The changes 
from the default value are within 0.2 GeV/c 2 . 

F. Summary of the systematic uncertainties 

The total systematic uncertainty on the top mass com- 
bining all the effects listed above is 2.1 GeV/c 2 . Ta- 
ble IXIIII summarizes all sources of systematic uncertain- 
ties with their individual contribution as well as the com- 
bined effect. 



IX. CONCLUSION 

We measure the mass of the top quark to be 
171.1 GeV/c 2 with a total uncertainty of 4.3 GeV/c 2 . 
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TABLE XIII: Summary of the systematic sources of uncer- 
tainty on the top mass. The sum in quadrature of all the 
effects represents the total systematic uncertainty. 



Source 



6-jet JES 
Residual JES 
Background Modeling 
Background Statistics 
Initial State Radiation 
Final State Radiation 

pp PDF Choice 
Pythia vs. Herwig 
Method Calibration 
Sample Composition 

Total 



Uncertainty (GeV/c z ) 



0.4 
0.7 
0.9 
0.4 
0.3 
1.2 
0.5 
1.0 
0.2 
0.1 

2.1 



quark mass measurement. 

As the luminosity collected with the CDF II detector 
increases to an expected 7 fb -1 for Run II, the statistical 
uncertainty on the top quark mass will improve and addi- 
tional top quark mass results from CDF are expected in 
the near future. A more careful estimation of the sources 
of systematic uncertainties on the top quark mass as well 

TABLE XIV: Most precise results from each ti decay channel 
from the Tevatron by March 2007. The integrated luminosity 
used in these analyses is about 1 fb _1 . 



Channel 



Result 



Lepton+Jets [3] 
Dilepton [4] 
All-hadronic (this analysis) 



All-hadronic (previous result) [2] 



170.9±2.5 GeV/c 2 
164.5±6.5 GeV/c 2 
171.1±4.3 GcV/c 2 
174.0±5.3 GeY/c z 



This measurement, the most precise to-date in the all- 
hadronic channel, is performed using 943 pb _1 of inte- 
grated luminosity collected with the CDF II detector. 
This is the first simultaneous measurement of the top 
quark mass and of the jet energy scale in the ti all- 
hadronic channel. It is also the first mass measurement in 
this channel that involved the use of the ti matrix clement 
in the event selection as well as in the mass measurement 
itself. 

The previous best mass measurement published in this 
channel, for an integrated luminosity of 1 fb _1 , has an 
equivalent total uncertainty of 5.3 GeV/c 2 [|J which is 
23% more than in this measurement. The main source 
for the observed improvement is the reduction of the 
uncertainty on the top quark mass due to jet energy 
scale (JES). In the present analysis, this uncertainty 
is 2.5 GeV/c 2 (including the residual JES uncertainty 
of 0.7 GeV/c 2 ), which is about twice smaller than the 
corresponding uncertainty of 4.5 GeV/c 2 determined in 
Ref. [|. 

The top quark mass measured in this analysis is 
consistent with the most precise top quark mass val- 
ues measured at the Tevatron and at CDF in the lep- 
ton+jets and the dilepton Q channels. This consis- 
tency among the decay channels restricts the possibil- 
ity for new physics to prefer the ti all-hadronic decay 
channel over the other decay channels. Table IXIVI sum- 
marizes the most precise top quark mass measurements 
made at the Tevatron using an integrated luminosity of 
about 1 fb _1 . From this table it can be seen that the all- 
hadronic channel provides the second most precise top 



as a more efficient ti event selection can help to further 
reduce the total uncertainty in this analysis. We expect 
that future mass measurements performed in this channel 
using an increased data sample size will improve the total 
uncertainty on the top quark mass which will contribute 
to our understanding of the electroweak interaction as 
well as to the search for new physics. 
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